Unusual Sub-sequence Identifications in Time Series with Periodicity
نویسندگان
چکیده
Fast and intelligent data mining has recently become an integral part of data analysis and a pre-requisite for modeling. This is largely due to the introduction of more sophisticated data collection tools and the possibility of observing large datasets at increased higher frequencies. This paper aims to investigate the current methodologies used for the detection of time series discord sub-sequences and especially those with periodicity. A strategy will be suggested to use classical data mining techniques and statistical decision making to take advantage of the special features of the time series to make the detection more efficient and more objective. An entropy-based measure will also be introduced as an alternative to the Euclidean distance measure for identifying discord sub-sequences.
منابع مشابه
The Effect of Rainfall Parameters on Runoff Using Wavelet Coherence Measure
In this research wavelet coherence measure is implemented for evaluating the relations and effect of rainfall parameters over many years on runoff fluctuations that is for testing proposed linkages between two time series. In this way, monthly Hydro climatological as 3 rainfall stations, one runoff in the outlet of Ardabil plain were used. The results illustrate that 8-12 and 8-16 month modes o...
متن کاملAN EXTENSIVE CENSUS OF HST COUNTERPARTS TO Chandra X-RAY SOURCES IN THE GLOBULAR CLUSTER 47 TUCANAE. I. ASTROMETRY AND PHOTOMETRY1
We report, in this study of 47 Tucanae, the largest number of optical identifications of X-ray sources yet obtained in a single globular cluster. Using deep Chandra/ACIS-I imaging and extensive HST studies with WFPC2 (including a 120 orbit program giving superb V and I images), we have detected optical counterparts to at least 22 cataclysmic variables (CVs) and 29 chromospherically active binar...
متن کاملPeriodicity Detection of Outlier Sequences Using Constraint Based Pattern Tree with MAD
Patterns that appear rarely or unusually in the data can be defined as outlier patterns. The basic idea behind detecting outlier patterns is comparison of their relative frequencies with frequent patterns. Their frequencies of appearance are less and thus have lesser support in the data. Detecting outlier patterns is an important data mining task which will reveal some interesting facts. The se...
متن کاملIdentification of Bifidobacterium Strains Isolated from Fecal Samples of Some Iranian Subjects Using 16SrRNA Gene Sequence Analysis and PCR-based Gene Specific Primers
For the first time in Iran 40 strains of Bifidobacterium were isolated from feces of Iranian subjects. By using phenotypic tests, 18 isolates were identified as Bifidobacterium longum, 10 as Bifidobacterium bifidum and one as Bifidobacterium catenolatum. In order to validate these results and also to identify other isolates that had not been identified by phenotypic tests, two methods of PCR wi...
متن کاملOn the Detection of Trends in Time Series of Functional Data
A sequence of functions (curves) collected over time is called a functional time series. Functional time series analysis is one of the popular research areas in which statistics from such data are frequently observed. The main purpose of the functional time series is to predict and describe random mechanisms that resulted in generating the data. To do so, it is needed to decompose functional ti...
متن کامل